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^ delta file that identifies only the changes between the original file and the revision file (2(X)). 



wo 01/98951 Al lillillliiliiliiiiiillillill 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of (he PCT Gazette. 



wo 01/98951 



PCTAJSOl/19898 

A SYSTEM AND METHOD FOR FILE 
TRANSMISSION USING FILE DIFFERENTIATION 



5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Patent Apphcation 
Serial No. 60/213,502, filed on June 22, 2000, and entitled 'DELTAMAN", which is 
incorpdrated by reference herein in its entirety. 



10 FIELD OF THE INVENTION 

The present invention relates to a method and system for updating files, and 

more particularly, relates to a method and system for efficiently synchronizing remote 

files using file differentiation. 



15 BACKGROUND OF THE INVENTION 

In many business environments, a server is used to store data that is pertinent 

to many employees or remote users of a business. The server is typically accessible by 
remote computer systems ("clients") to increase the availability of information to the 
remote users. By providing files on a server, which may be accessed by remote 
20 computer systems, dissemination of information through the company is increased. 
Remote access to a file is more critical in environments where a sales force or many 
employees operate away firom the oflBice. As an example, the remote employees rely 
on the information contained within the file to be informed about inventory changes. 
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pricing data, and company events. Rather than remain connected to the server 
indefinitely and collect telecommunication charges or tie up phone lines, the remote 
users only intermittently connect their computers to a server for access to the files on 
the server. In these environments, the remote computer systems typically store the 

5 server file locally to support the remote ^plication even when the client is not 

coimected to the server. The intermittent connection is then used to send only changes 
made by the cHent application to the server and a pertinent set of changes firom the 
server to the client. This type of remote computer system environment is called an 
Intermittently Coimected (EC) ©aviromnent ICs have a wide variety of applications in 

10 sales force automation, insurance claim processing, and mobile work forces in general 
anwhere there are mobile users. 

An important commimication issue for this type of computer environment is 
the timely and efficient exchange of information between the clients and the server. 
The term "file transfer'* is often used to describe the process of maintaining data 

15 consistency and integrity among server files and cUent files. There are many 

synchronization schemes for maintaining consistency. In some known file transfer 
schemes, various protocols and methods, for example compression to e£Eiciently 
transfer files, are used. 

Thus, heretofore an unaddressed need exists in the industry to address the 

20 aforementioned deficiencies in downloading files to a system quickly and efficiently. 
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SUMMARY OF THE INVENTION 
The invention provides a system and method for efficiently transferring files 
using file differentiation. The invention may be conceptualized as a file 
differentiation system that includes a client device with a device file, a server device 
5 containing an original file and a revision file of the original file, and a delta file that 
identifies only the changes between the original file and the revision file. 

The invention may also be conceptualized as a method for efiSciently 
transferring files using file differentiation, the method comprising the steps of: (1) 
providing an original file; (2) creating a revision file of the original file; and (3) 
10 generating a delta file that identifies only the changes between the original file and the 
revision file. 

BRIEF DESCRimON OF THE DRAWINGS 
The present invention^ as defined in the claims, can be better understood with 

reference to the following drawings. The components within the drawings are not 

15 necessarily to scale relative to each other, emphasis instead being placed upon clearly 

illustrating the principles of the present invention. 

FIG. 1 is a block diagram illustrating the network environment in which a 

computing device including the file difference synchronization system 100 of the 

present invention, 

20 FIG. 2 is a block diagram illustrating an example of a computer system 

utilizing the file difference synchronization system 100 of the present invention. 
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FIGS. 3A and 3B are flow charts collectively illustrating an example of the 
process flow of the file difference synchronization system 100 of the present 
invention, as shown in FIG. 2. 

FIG. 4 is a flow chart illustrating an example of traversal routine, as shown in 
5 FIG. 3A, operating with the file difference synchronization system 100 of the present 
invention. 

FIG. S is a flow chart illustrating an example of deletechild routine, as shown 
in HG. 3B, operating with the file difference synchronization system 100 of the 
present invention. 

10 FIG. 6 is a flow chart illustrating an example of insertchild routine, as shown 

in FIG. 3B, operating with the file difference synchronization system 100 of the 
present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
15 The invention to be described hereafter is applicable to all file transfer systems 

using a file differentiation system in the present indention to maintain remote file 

synchronizatioiL While described below with respect to a single computer, the system 

and method for a file difference synchronization system 100 is typically implemented 

in a networked computing arrangement in which a number of computing devices 

20 communicate over a local area network (LAN), over a wide area network (WAN), or 

over a combination of both LAN and WAN. 

Referring now to the drawings, in which like numerals illustrate like elements 

throughout the several views, Fig. 1 illustrates the basic components of an intermittent 
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connected file transfer system ("ICFT*^ 10 used in connection with the preferred 
embodiment of the present invention. The system 10 includes client systems 16a, 
16b, and 16c. Each client has 8?)plications and a local file 15a, 15b, and 15c. A 
computer server 14 contains appUcations and a server file 15d that are accessed by 

5 client systems 16(a-c) via intermittent connections 13(a-c), respefctively, over network 
12. The server 14 runs administrative software for a computer network and controls 
access to part or all of the network and its devices. The client systems 16(a-c) share 
the server data stored at the computer server 14 and may access the server 14 over a 
network 12, such as but not Umited to: the Internet, a local area network (LAN), a 

10 wide area network (WAN), via a telephone line using a modem or other like networks. 

The server 14 may also be connected to the local area network (LAN) within an 

I 

organization. 

The structure and operation of the ICFT system 1 0 enables the server 14 and 
the server file 15d associated therewith to handle clients more efficiently than 

15 previously known systems. Particularly, the present invention provides a manner of 
organizing data of the server file into updates that enable a remote chent system to 
update its remote file more efficiently. Periodically, a modification ("delta" or 
**update") file is created for each client with all relevant changes since the last 
modification file creation. When the cUents systems 16(a-c) connect to the server 14, 

20 the modification files associated with the chent are transmitted to the client to be used 
for updating each cUenfs individual files. 

The chent systems 16a" 16c may each be located at remote sites. Thus, when a 
user at one of the remote chent systems 16(a-c) deskes to be updated with the current 
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information &om the shared file at the server 14, the client system 16(a-c) 
communicates over the network 12, such as but not limited to WAN, intemet, or 
telephone lines to access the server 14. Advantageously, the present invention 
provides a system and methqd for updating cUent systems to most efficiently transfer 

5 their remote files with the file ISD on the server 14. Periodically, the server 
determines the data that has changed for each client since the last evaluation, and 
records those changes in a modification file. When a cUent coimects to the server, it 
requests the modification files for the cUent, creates the dowidoaded modification 
files, and updates its local file. 

10 Hence, the present invention provides for a more eflScient approach to 

maintaining synchronization of remote client files, hi this approach, the server 14, 
compares an original file with a revision of the file, and generates a delta modification 
file which describes the changes that need to be made to the original file to create the 
revised file on the client 16. This delta or modification file has been transmitted to the 

15 remote user, where the changfes, as described in the delta modification file, will be 
applied to the remote users copy of the original file to create the revised file. 

Generally, the file difference synchronization method will go along comparing 
bytes in both files, as long as they match, the count is increased, which will be the 
amount for a skip record. When there is a mismatch, a token's worth of bytes at tiie 

20 mismatch point is grabbed from both the original file and revision file. With the 

token from the original file, there is an attempt to find that token in the revision file. If 
the matching token is found, it is called a "sync" and there is an assumption that there 
was an insert. Likewise, with the token from the revision file there is an attempt to 
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find it in the original file. If a match is found, it is called a "sync" and thwe is an 
assumption that there has been a delete. If neither is found, then it is assumed that 
there was a replace of one byte, advance both file pointers, grab tokens from both files 
and continue to look for a sync point. 
5 The method for merging the delta with the original file on the client system 16 

(A-C) will read a record from the delta, and then will do one of the three things: (1) 
copy bytes from the original file to the new (an unchanged region); (2) skip over bytes 
in the original file, not copying them (a deletion in the original); or (3) copy bytes 
from the deltas to the new revised file (and insert into the original insert). 

10 The delta modification file will be made up of records, each having a type, 

followed by a length and some having data following. There are four types of records: 
(1) skip - indicating a match region, no data follows; (2) delete - indicates a portion of 
the original that needs to be deleted, no data follows; (3) insert - inserts bytes into the 
original, the data to be inserted follows; or (4) replace - a combination of delete and 

15 insert. 

For example, consider the following two strings. 
ABCDEFGm and ABCxxxDE 

If we consider the first to be the original and the second to be the revision, the 
delta would be: 

20 SKIPS ^ Copy the ABC fix)m original to revision 

nsrSERT 3 XXX Add xxx to the revision 
SKIP 2 Copy DE from the original to the revision 

DELETE 4 Dorft copy FGHI 
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Generally, in tenns of hardware architecture, as shown in FIG. 2, the 
computers 14 & 16 include a processor 41, storage 42 memory 42, and one or more 
input and/or output (I/O) devices (or poipherals) that are communicatively coupled 
via a local interface 43, The local interface 43 can be, for example but not limited to, 

5 one or more buses or other wired or wireless connections, as is known in the art. The 
local niterface 43 may have additional elements, which are omitted for simplicity, 
such as controllers, buffers (caches), drivers, repeaters, and receivers, to enable 
communications. Further, the local interface 43 may include address, control, and/or 
data connections to enable appropriate communications among the aforementioned 

10 components. 

The processor 41 is a hardware device for executing software that can be 
stored in memory 42. The processor 41 can be virtually any custom made or 
commercially available processor, a central processing unit (CPU) or an auxiliary 
processor among several processors associated with the computer 14 & 16, and a 

1 5 semiconductor based microprocessor (in the form of a microchip) or a 

macroprocessor. Examples of suitable conMnercially available microprocessors are as 
follows; an 80x86 or Pentium series microprocessor from Intel Corporation, U.S.A., a 
PowerPC microprocessor from IBM, U.S.A., a Sparc microprocessor from Sun 
Microsystems, Ihc, a PA-RISC series microprocessor from Hewlett-Packard 

20 Company, U.S.A., or a 68xxx series microprocessor from Motorola Corporation, 
U.S.A. 

The memory 42 can include any one or combination of volatile memory 
elements {e.g. , random access memory (RAM, such as dynanuc random access 
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memory (DRAM), static random access memory (SRAM), etc,)) and nonvolatile 
memory elements (e.g., ROM, erasable programmable read only memory (EPROM), 
electronically erasable programmable read only memory (EEPROM), programmable 
read only memory (PROM), tape, compact disc read only memory (CD-ROM), disk, 
5 diskette, cartridge, cassette or the like, etc.). Moreover, the memory 42 may 

incorporate electronic, magnetic, optical, and/or other types of storage media. Note 
that the memory 42 can have a distributed architecture, where various components are 
situated remote from one another, but can be accessed by the processor 41 . File 1 5 
resides in memory 42. 

10 The software in memory 42 may include one or more separate programs, each 

of which comprises an ordered Usting of executable instructions for implementing 
logical functions. In the example of FIG. 2, the software in the memory 42 includes a 
suitable operating system (0/S) 52 and the file difference synchronization system 100 
of the present invention. 

1 5 A non-exhaustive list of examples of suitable commercially available 

operating systems 52 is as follows: a Windows operating system from Microsoft 
Corporation, U.S.A., a Netware operating system available from Novell, Inc., U.S.A., 
an operating system available from IBM, Inc., U.S.A., any LINUX operating system 
available from many vendors or a UNIX operating system, which is available for 

20 purchase from many vendors, such as Hewlett-Packard Company, U.S.A., Sun 
Microsystems, Inc. and AT&T Corporation, U.S.A. The operating system 52 
- essentially controls the execution of other computer programs, such as the file 
difference synchronization system 100, and provides scheduling, input-output control. 
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file and data management, memory management, and conrnmnication control and 
related services. However, it is contemplated by the inventors that the file difference 
synchronization system 100 of the present invention is applicable on all other 
commercially available operating systems. 

5 The file difference synchronization system 1 00 may be a source program, 

executable program (object code), script, or any other entity comprising a set of 
instructions to be performed. When a source program, then the program is usually 
translated via a compiler, assembler, interpreter, or the hke, which may or may not be 
included within the memory 42, so as to operate properly in connection with the 0/S 

10 52. Furthermore, the file difference synchronization system 100 can be written as (a) 
an object oriented progranaming language, which has classes of data and methods, or 
(b) a procedure prograinming language, which has routines, subroutines, and/or 
functions, for example but not limited to, C, C+ +, Pascal, BASIC, FORTRAN, 
COBOL, Perl, Java, and Ada. 

15 The I/O devices may include input devices, for example but not limited to, a 

keyboard 45, mouse 44, scanner (not shown), microphone (not shown), etc. 
Furthermore, the I/O devices may also include output devices, for example but not 
limited to, a printer (not shown), display 46, etc. Finally, the I/O devices may fiirther 
include devices that communicate both inputs and ou^uts, for instance but not limited 

20 to, a NIC or modulator/demodulator 47 (for accessing other files, devices, systems, or 
a network), a radio fi:equency (RF) or other transceiver (not shown), a telephonic 
interface (not shown), a bridge (not shown), a router (not shown), etc. 
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If the computer 14 & 16, is a PC, workstation, or the like, the software in the 
memory 42 may further include a basic input output system (BIOS) (omitted for 
simplicity). The BIOS is a set of essential software routmes that initialize and test 
hardware at startup, start the 0/S 52, and support the transfer of data among the 
5 hardware devices. The BIOS is stored in ROM so that the BIOS can be executed 
when the computer 14 & 16 is activated. 

When the computer 14 & 16 is in operation, the processor 41 is configured to 
execute software stored within the memory 42, to communicate data to and from the 
memory 42, and to generally control operations of the computer 14 & 16 pursuant to 
10 the software. The file difference synchronization system 100 and the 0/S 52 are read, 
in whole or in part, by the processor 41, perhaps buffered within the processor 41, and 
then executed. 

When the file difference synchronization system 100 is implemented in 
software, as is shown in FIG. 2, it should be noted that the file difference 

15 synchronization system 1 00 can be stored on virtually any computer readable medium 
for use by or in connection with any computer related system or method. In the 
context of this document, a computer readable medium is an electronic, ms^etic, 
optical, or other physical device or means that can contain or store a computer 
program for use by or in connection with a computer related system or method. The 

20 file difference synchronization system 100 can be embodied in any computer-readable 
medium for use by or in coimection with an instruction execution system, apparatus, 
or device, such as a computer-based system, processor-containing system, or other 
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system that can fetch the instractions from the instruction execution system, 
apparatus, or device and execute the instructions. 

In the context of this document, a "computer-readable medium" can be any 
means that can store, communicate, propagate, or transport the program for use by or 

5 in connection with the instruction execution system, apparatus, or device. The 
computer readable medium can be, for example but not limited to, an electronic, 
magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, 
device, or propagation medium. More specific examples (a nonexhaustive list) of the 
computer-readable medium would include the following: an electrical connection 

10 (electronic) having one or more wires, a portable computer diskette (magnetic), a 
random access memory (RAM) (electronic), a read-only memory (ROM) (electronic), 
an erasable programmable read-only memory (EPROM, EEPROM, or Flash memory) 
(electronic), an optical fiber (optical), and a portable compact disc read-only memory 
(CDROM) (optical). Note that the computer-readable medium could even be paper or 

15 another suitable medium upon which the program is printed, as the program can be 
electronically captured, via for instance optical scanning of the paper or other 
medium, then compiled, interpreted or otherwise processed in a suitable maimer if 
necessary, and then stored in a computer memory. ' 

In an alternative embodiment, where the file difference synchronization system 

20 100 is implemented in hardware, the file difference synchronization system 100 can 
be implemented with any one or a combination of the following technologies, which 
are each well known in ttie art: a discrete logic circuit(s) having logic gates for 
implementing logic fimctions upon data signals, an application specific integrated 
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circuit (ASIC) having ^propriate combinational logic gates, a programmable gate 
array(s) (PGA), a field programmable gate array (FPGA), etc. 

Illustrated in Figures 3 A and 3B are flow charts collectively depictmg an 
example of the process flow of the file difference synchronization system 100 of the 
5 present invention, as shown in FIG. 2. It is in the 

First at step 101, the file difference synchronization system 100 is initialized. 
Next at step 102, the file difference synchronization system 100 determines if the 
original file 15 or the revision file 19 have either reached an end of file status (EOF). 
If it is determined at step 102 that neither the original file 15 or revision file 19 have 
10 reached an end of file (EOF) status, then the file difference synchronization system 
100 proceeds to step 103 to determine if the difference tree is at a max depth. 

If it is determined at step 103 that the difference tree is at a max depth, then 
the file diflference synchronization system 100 performs the traversal routine at step 
104. The traversal routine is herein defined in further detail with regard to Fig. 4. 
15 After performing the traversal routine at step 1 04, the file difference synchronization 
system 100 then returns to repeat step 102. However, if it is determined at step 103 
that the difference tree is at a max depth, then the file difference synchronization 
system 100 then proceeds for further processing at step 121 with regard to Fig. 3B. 

At step 121 (Fig. 3B), the file difference synchronization system 100 initiates a 
20 new leaf node and then sets a skipcounter to zero at step 122. At step 123 the file 
difference synchronization system 1.00 then determines if the current byte of the 
original file 15 matches a cunrotit byte of the revision file 19. If it is determined at 
step 123 that the current byte of the original file 15 matches the current byte of the 
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revision file 15, then the file dijQference synchronization system 100 increments the 
skipcoimter at step 124 and returns to repeat step 123. This situation occurs as long as 
the data in the original file 15 and revision file 19 match. However, if it is determined 
at step 123 that the current byte of the original file 15 does not match the current byte 

5 of the revision file 19, then the file difiference synchronization system 100 adds a 
skiprecord to the leaf node using the current skipcounter value at step 125. This 
situation occurs as long as the data in the original file 15 and revision file 19 do not 
match. At step 126, the file difference synchronization system 100 then sets the 
replace coimter to zero. 

10 At step 127 the file difference synchronization system 100 determines if the 

original file 15 and revision file 19 are synced for an insertion and deletion. In order 
to determine if an insertion and deletion has taken place when a mismatch occurs in 
the comparison of the original file 15 and revision file 19, scanning is preformed in 
both the original file 15 and revision file 19 to search for the next common block of 

1 5 data. If a data segmrat of the origmal file 1 5 is found when scaiming forward in the 
revision file 19, it is assumed that an insertion has occurred. In addition, if a data 
segment of the revision file 19 is found when scanning forward in the original file 15, 
it is assumed that a deletion has occurred 

If it is determined at step 127 that the original file 15 and revision file 19 are 

20 synced for an insertion and deletion, then the file difference synchronization system 
100 then returns to step Fig. 3 A. The original file 15 and revision file 19 are synced 
when the binary tree has created both a delete node and an insertion node. However, if 
it is determined at step 127 that the original file 15 and revision file 19 are not synced 
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for an insertion and deletion, the file difference synchronization system 100 then 
determines if the token is matched for a deletion at step 131. The original file 15 and 
revision file 19 are not synced when flie binary tree has created only a delete node or 
an insertion node. If it is determined at step 131 that the original file 15 and revision 

S file 19 are matched for deletion, then the file difference synchronization system 100 
then performs the deletechild routine at step 132. The deletechild routine is herein 
described in further detail with regard to Fig. 5. After performing the deletechild 
routine at step 132, tiie file difference synchronization system 100 then proceeds to 
step 139 and returns to Fig. 3 A. 

10 However, if it is determined at step 131 that the original file 15 and revision 

file 19 are not matched for deletion, then the file difference synchronization system 
100 then determines if the token is matched for an insertion at step 133. If it is 
determined at step 133 that the token is matched for insertion, the file difference 
synchronization system 100 then performs the insertchild routine at step 134. The 

1 5 insertchild routine is herein defined m further detail with regard to Fig. 6. After 
performing the insertchild routine at step 134, the file difference synchronization 
system 100 returns to Fig. 3 A. 

Notwithstanding, if it is detennined at step 133 that the tokens are not matched 
for an insertion, then tiie file difference synchronization system 100 advances a 

20 current position in both the original file 15 and revision file 19 at step 135. At step 
136 the file difference synchronization system 100 increments the replacement 
counter and returns to repeat steps 127 through 136. After completing further 
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processing the fiile dijBFerence synchronization system 100 then returns to Fig. 3A to 
repeat step 102. 

If however, it is determined at step 102 that either the original file 15 or 
revision file 19 has incurred an end of file status (EOF), then the file difference 

5 synchronization system 1 00 calculates, the path cost at step 111. The path cost is 
calculated by traversing each path from root to leaf. The path cost is based upon the 
size of the delta. At step 1 12, the file difference synchronization system 100 traverses 
the least cost path (i.e., the path with the smallest delta) writing out records in each 
node to the difference file 200! 

10 At step 1 13, the file difference synchronization system 100 then determines if 

the end of file (EOF) status has been reached for the revision file 19 (Fig. 2). If it is 
determined at step 113 that the end of file (EOF) status for the revision file has not 
occurred, the file difference synchronization system 100 then writes the insert record 
to the difference file 200 (Fig. 2) with a remainder of the original file 15 as an insert 

15 striiig at step 114. After writing the insert record at step 1 14, the file difference 
synchronization system 100 then exits at step 119. 

However, if it is determined at step 113 that the end of file (EOF) status for the 
revision file 19 has occurred, then the file difference synchronization system 100 then 
counts the bytes remaining in the original file at step 115. At step 1 16, the file 

20 difference synchronization system 100 writes a delete record to the difference file 200 
with the count of the bytes remaining in the original file at step 1 16. The file 
difference synchronization system 100 then exits at step 119. 
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Illustrated in Fig. 4 is a flow chart illustrating an example of traversal routine 
140, as shown in FIG. 3A, operating with the file difference synchronization system 
100 (Figs. 3A and 3B) of the present invention. The traversal routine picks a tree 
depth, and once reached, writes the root node to the delta, finds the best path fi:om 

5 root to leaf node, and then throw away {i.e. prunes) the root plus the other half of the 
tree (promoting either the pRoot->delete or pRoot->insert to pRoot), and goes again. 
This way the tree is kept at a specified depth, thus keqping memory allocation to a 
known amount, as well as keeping down the amount of time spent scanning. 

First, the traversal routine 140 is initiaUzed at step 141 . At step 142 the 

10 traversal routine 140 writes the contents of the root node to the diflFerence file 200 
(Fig. 2). Next at step 143, the traversal routine 140 traverses each path fi*om the root 
to each leaf calculating the path costs of each traversal. 

At step 144, the traverse routine 140 determines if the least cost path was a 
child of a roofs delete subtree. If it is determined at step 144 that the least cost path 

15 was not a child of a root delete subtree then the traversal routine 140 makes the root 
node's insert child into the new root at step 151. At step 152, the traversal routine 140 
deletes the old root node's delete subtree and then proceeds to step 156. However, if 
it is determined at step 144 that the least cost path was a child of a ropfs delete 
subtree, then the traversal routine 140 makes the root node's delete child the new root 

20 at step 153. At step 154, the traversal routine 140 deletes the old root node's insert 
subtree. At step 156 the traversal routine 140 then deletes the old root node and exits 
the trav^al routine at step 159. 
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Illustrated in Fig. 5 is a flowchart illustrating aa example of the deletechild 
routinel60, as shown in Fig. 3B, operating with the file difference synchronization 
system 100 (Figs. 3A and 3B) of the present invention. First the deletechild routine 
160 is initialized at step 161. At step 162, the deletechild routine 160 creates a new 

5 node deletechild. The deletechild routine 1 60 then adds the delete record to the 
deletechild at step 163. At step 164 , the deletechild routine 160 determines if the 
replace counter is greater than zero. If it is determined at step 164 that the replace 
counter is not greater than zero, the deletechild routine 160 then proceeds to step 166. 
However, if it is determined at step 166 that the replace counter is greater than zero, 

10 then the deletechild routine 160 adds the replace record to the deletechild using the 
value of the replace counter at step 165. 

At step 166, the deletechild routine 160 makes a deletechild a child of the leaf 
node on the delete side, and advances the pointer to the original file 15 and the pointer 
to the revision file 19 at step 167. At step 168, the deletechild routine increments the 

15 replace counter and exits at step 169. 

Illustrated in Fig. 6 is a flowchart illustrating an example of the insertchild 
routine 1 80, as shown m Fig. 3B, operating with the file difference synchronization 
. system 100 (Figs. 3A and 3B) of the present invention. First at step 181, the 
insertchild routine is initialized. At step 182, the insertchild routine 180 creates a new 

20 node insertchild. At step 1 83, an insert record is added into the insertchild. At step 
184, the insertchild routine 180 determines if the replacement counter is greater than 
zero. If it is determined at step 1 84 that the replacement counter is not greater than 
zero, then the insertchild routine 180 then proceeds to step 186. However, if it is 
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detennined at step 184 that the replaced counter is greater thaa zero, then the 
insertchild routine 180 adds the replace record to the insertchild using the value of the 
replace counto: at step 185. 

At step 186, the insertchild routine 180 makes the insertchild a child of the leaf 
5 node on the insert side and advances the origmal file pointer and the revision file 
pointer at step 187. At step 188, the insertchild routine 180 increments the 
replacement counter and then exits at step 189. 

It will be apparent to those skilled in the art that many modifications and 
variations may be made to embodiments of the present invention, as set forth above, 
10 without departing substantially from the principles of the present invention. All such 
modifications and variations are intended to be included herein within the scope of the 
present invention, as defined in the claims that follow. 
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CLAIMS 

What is claimed is: 

1 . A method for transmitting a file that is modified, comprising the steps 

of: 

5 providing an original file; 

creating a revision file of the original file; and 

generating a delta file that identifies only the changes between the original file 
and the revision file. 



10 2. The method of Claim 1, wherein said step of generating a delta file 

fiurther comprises the step of: 

creating a binary tree to identify the changes between the original file and the 
revision file. 

15 3 . The method of Claim 2, wherein said step of generating a delta file 

fiirther comprises the step of: 

determining a least cost path in the binary tree. 

4. The method of Claim 2, wherein said step of generating a delta file 
20 further comprises the steps of: 

comparing each block of data in the original file with each block of data in the 
revision file; and 
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generating a node in tiie binary tree for each mismatch of each block of data in 
the original file and each block of data in the revision file. 



5. The method of Claim 4, further comprising the step of : * 

5 transmitting the delta file to a device so the device can recreate the revision 

file using only the delta file and a device original file. 

6. A system for transmitting a file that is modified, comprising: 
a chent device with a device file; 

10 a server device containing an original file and a revision file of the original 

file; and 

a delta file that identifies only the changes between the original file and the 
revision file. 

15 7 The system of Claim 6, wherein the client device fiuther comprises: 

a synchronization module that create the revision file on the client device 
using the delta file and a device file. 

8. The system of Claim 6, wherein the server device fiirther comprises: 
20 a transmission module that transmits the deUa file to the cUent device so the 

client device can recreate the revision file. 



9. The system of Claim 6, wherein the server device finlher coixq)rises: 
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a compare module that compares each block of data in the original file with 
each block of data in the revision file. 



1 0. The system of Claim 6, wherein the server device fiirfher comprises: 
5 a tree module that creates a binary tree to identify the changes betwem the 

original file and the. revision file; and 

a node module that generates a node in the binary tree for each mismatch of 
each block of data in the original file and each block of data in the revision file. 



10 1 1 . A computer readable medium for a logic that transmits a file that is 

modified, comprising: 

logic for creating an original file; 

logic for creating a revision file of the original file; and 

logic for generating a delta file that identifies only the changes between the 

15 original file and the revision file. 

12. The computer readable medium of claim 1 1 , wherein the logic for 
generating fiirther comprises: 

logic for creating a binary tree to identify the changes between the original file 
20 and the revision file. 
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13. The computer readable medium of claim 12, wherein the logic for 
generating further comprises: 

logic for determining a least cost path in the binary tree. 

5 14. The computer readable medium of claim 12, wherein the logic for 

generating further comprises: 

logic for comparing each block of data in the original file with each block of 
data in the revision file; and 

logic for generating a node in the binary tree for each mismatch of each block 
10 of data in the original file and each block of data in the revision file. 

15. The computer readable medium of claim 11, wherein the logic for 
generating further comprises: 

logic for transmitting the delta file to a device so the device can recreate the 
15 revision file using only the delta file and a device original file. 

16. A system for transmitting a file that is modified, comprising: 
means for creating an original file; 

means for creating a revision file of the original file; and 
20 means for generating a delta file that identifies only the changes between the 

original file and the revision file. 
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17. The system of Claim 16, further comprises: 
means for creating a binary tree to identify the changes between the original 
file and the revision file. 

5 18. The system of Claim 16, further comprises: 

means for determining a least cost path in the binary tree. 

19. The system of Claim 16, further comprises: 

means for comparing each block of data in the original file with each block of 
10 data in the revision file; and 

means for generating a node in the binary tree for each mismatch of each block 
of data in the original file and each block of data in the revision file. 

20. The system of Claim 16, further comprises: 

15 means for transmitting the delta file to a device so the device can recreate the 

revision file using only the delta file and a device original file. 
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